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Content of this Talk 

• Real-world problems from game dev 

• Small problems, that is, and easy to state 

• Actual solutions used in shipping games 

• Using math that's not too advanced 


Strategies for finding elegant solutions 



GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Occlusion boxes 

• Plain boxes put in world as occluders 

• Extrude away from camera to form occluded 
region of space where objects don't need to be 
rendered 


How to do this most efficiently? 
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Camera 
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Occlusion boxes 

• Could classify box faces as front/back 
and find silhouette edges 
• Similar to stencil shadow technique 


A better solution accounts for small 
solution space 
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Occlusion boxes 

• There are exactly 26 possible silhouettes 

• Three possible states for camera position on 
three different axes 

• position < box min 

• position > box max 

• box min < position < box max 

• Inside box excluded 
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condition 

code 

x > xmax 

0x01 

x < xmin 

0x02 

y > ymax 

0x04 

y < ymin 

0x08 

z > zmax 

0x10 

z < zmin 

0x2 0 
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Finite classifications 





Marching Cubes, fixed polarity 

(256 cases, 18 classes) 
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Transvoxel Algorithm 

(512 cases, 73 classes) 
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Occlusion boxes 

• Calculate camera position state and use 
table to get silhouette 


Always a closed convex polygon with 
exactly 4 or 6 vertices and edges 
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Occlusion boxes 



// Upper 3 bits = vertex count, lower 5 bits = polygon index 
const unsigned_int8 occlusionPolygonlndex [43] = 


0x00, 

0x80, 

0x81, 

0x00, 

0x82, 

0xC9, 

0xC8 , 

0x00, 

0x83, 

0xC7 , 

0xC6, 

0x00, 

0x00, 

0x00 

0x84, 

OxCF, 

OxCE, 

0x00, 

OxDl, 

0xD9, 

0xD8, 

0x00, 

OxDO, 

0xD7 , 

0xD6, 

0x00, 

0x00, 

0x00 

0x85, 

OxCB, 

OxCA, 

0x00, 

OxCD, 

0xD5, 

0xD4 , 

0x00, 

OxCC, 

0xD3, 

0xD2 





} ; 

// All 26 polygons with vertex indexes from diagram on left 
const unsigned_int8 occlusionVertexIndex [26] [ 6] = 

{ 


{1, 

3, 

7, 

5} 

, 


(2, 

o. 

4, 

6} 

, 


{3, 

2, 

6 , 

7} 

, 


(0, 

1 , 

5 , 

4} 

, 


{4, 

5, 

7, 

6} 

, 


{1, 

o. 

2, 

3} 

, 


(2, 

o. 

1 , 

5, 

4, 

6} 

{0, 

1 , 

3, 

7, 

5, 

4} 

{3, 

2, 

o. 

4, 

6, 

7} 

U, 

3, 

2 , 

6, 

7, 

5} 

{1, 

o. 

4, 

6, 

2, 

3} 

{5, 

1 , 

o. 

2, 

3, 

7} 

{4, 

o. 

2 , 

3, 

1 , 

5} 

(0, 

2, 

6 , 

7, 

3, 

1} 

{0, 

4, 

5 , 

7, 

6, 

2} 

{4, 

5 , 

1 , 

3, 

7, 

6} 

{1, 

5, 

7, 

6, 

4, 

0} 

{5, 

7, 

3, 

2, 

6, 

4} 

(3, 

1 , 

5, 

4, 

6, 

2} 

(2, 

3, 

7, 

5, 

4, 

0} 

H» 

o, 

4, 

6, 

7, 

3} 

{0, 

2, 

6 , 

7, 

5, 

1} 

{ 7 , 

6, 

2, 

o. 

1 , 

5} 

{6, 

4, 

o. 

1, 

3, 

7} 

{5, 

7, 

3, 

2, 

o. 

4} 

{4, 

5, 

1 , 

3, 

2, 

6} 


0x00, 

0x00, 


0x00 

0x00 
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Occlusion boxes 

• Any silhouette edge that is off screen can be 
eliminated to make occlusion region larger 

• Gives occluder infinite extent in that direction 


Allows more objects to be occluded because 
they must be completely inside extruded 
silhouette to be hidden 
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Occlusion boxes 

• Silhouette edge is culled if both vertices on 
negative side of some frustum plane 


And extruded plane normal and frustum plane 
normal have positive dot product 
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Camera 
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Occlusion boxes 

• Strategy: 


Look for ways to classify solutions 
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Oblique near plane trick 

• Sometimes need a clipping plane 
for a flat surface in scene 

• For example, water or mirror 

• Prevent submerged objects from 
appearing in reflection 
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Ordinary frustum 


Oblique near plane 
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Oblique near plane trick 

• Hardware clipping plane? 

• May not even be supported 

• Requires shader modification 

• Could be slower 
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Oblique near plane trick 


• Extra clipping plane almost 
always redundant with 
near plane 



Don't need to clip to both 
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Oblique near plane trick 

• Possible to modify projection matrix 

• Move near plane to arbitrary location 


No extra clipping plane, no redundancy 
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Oblique near plane trick 

• In normalized device coordinates (NDC), 
near plane has coordinates (0,0, 1,1) 


t 


Z = 1 


( 0 , 0 , 1 , 1 ) 

A 


Z = ~ 1 
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Oblique near plane trick 

• Planes (row antivectors) are transformed from 
NDC to camera space by right multiplication by 
the projection matrix 

• So the plane (0, 0, 1, 1) becomes 

M 3 + M 4/ where M, is the /'- th row of the 
projection matrix 



GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Oblique near plane trick 

• M 4 must remain (0, 0, -1, 0) so that 
perspective correction still works right 

• Let C = (C x/ C y , C z , C w ) be the camera-space 
plane that we want to clip against 

• Assume C w < 0, camera on negative side 


We must have C = M 3 + (0, 0, -1, 0) 
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Oblique near plane trick 

• M 3 = C - M 4 = (C x; Cy, C z + 1, C w ) 


M = 


e 

0 

C x 

0 


0 

e/a 

Cy 

0 


0 

0 

C z + 1 
-1 


0 

0 

c 

0 


w 


This matrix maps points on the plane C 
to the plane z = -1 in NDC 



GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Oblique near plane trick 

• But what happens to the far plane? 

. F = M 4 - M 3 = 2M 4 - C 

• Near plane and (negative) far plane differ 
only in the z coordinate 

• Thus, they must coincide where they 
intersect the z = 0 plane 
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Oblique near plane trick 
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Oblique near plane trick 

• Far plane is a complete mess 

• Depths in NDC no longer represent distance 
from camera plane, but correspond to some 
skewed direction between near and far planes 


We can minimize the effect, 
and in practice it's not so bad 
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Oblique near plane trick 

• We still have a free parameter: 
the clipping plane C can be scaled 

• Scaling C has the effect of changing the 
orientation of the far plane F 

• We want to make the new view frustum as 
small as possible while still including the 
conventional view frustum 
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Oblique near plane trick 

• Let F = 2M 4 - aC 

• Choose the point Q which lies furthest opposite 
the near plane in NDC: 

Q = M _1 (sgn(C*),sgn(C y ),l,l) 

• Solve for a such that Q lies in plane F: a = M4 A ^ 

CaQ 



GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Oblique near plane trick 

• Near plane doesn't move, but far plane 
becomes optimal 
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Oblique near plane trick 

• Works for any perspective projection matrix 

• Even with infinite far depth 

• More analysis available in " Oblique Depth 
Projection and View Frustum Clipping ". Journal 
of Game Development, Vol. 1, No. 2. 
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Oblique near plane trick 

• Strategy: 


Get the big picture 
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Fog bank occlusion 

• Consider fog bank bounded by plane 

• Linear density gradient with increasing depth 

• Perpendicular to plane 

• Zero density at plane 
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Fog bank occlusion 

F 


FaP 

P 
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Fog bank occlusion 
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Fog bank occlusion 

• For a given camera position, we want to cull 
objects that are completely fogged 


Surface beyond which objects are completely 
fogged is interesting... 
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(video) 
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Fog bank occlusion 

• Culling against that curve is impractical 

• Instead, calculate its maximum extent parallel 
to the fog plane 


Then cull against plane perpendicular to fog 
plane and camera view direction at that distance 
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Fog bank occlusion 
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Fog bank occlusion 

• F = fog plane, normal outward 

• C = camera position 

• P = point being shaded 

. V = C - P 

• a = linear density coefficient 
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Fog bank occlusion 


Density as function of depth: 


/?(P) = -a(F aP) 


Log light fraction g( P) for given C and P: 


.+ 


g(P) = -a||V 


FaP+FaC 


+ See " Unified Distance Formulas for Halfspace Foq ", Journal of Graphics Tools, Vol. 12, No. 2 (2007). 
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Fog bank occlusion 

• Set g( P) to log of small enough fraction to be 
considered fully fogged 

• For example: g( P) = log(l/256) 


This is constant 


GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Fog bank occlusion 

• For given C, we need to find P with the 
maximum horizontal distance d from C that also 
satisfies 



FaP+FaC 


g( p ) = -a 


2 
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Fog bank occlusion 

• So express d as a function of P and find the 
zeros of the derivative, right? 

• Turns out to be a huge mess 

• Not clear that good solution exists 
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Fog bank occlusion 

• Insight: instead of using independent variable P ; 
express F a P as a fraction of F a C 


FaP = /(FaC) 




0 + 1)(FaC) 


2 


GAME DEVELOPERS CONFERENCE® 2015 


Fog bank occlusion 


F 

A 



/(FaC) 

(1-/)(FaC) 



d 


MARCH 2-6, 2015 GDCONF.COM 


d 2 +(/-l ) 2 (F aC ) 2 


FaC 
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Fog bank occlusion 


• Can now write g( P) as follows 

8 (P) = ~ it + 1)(F a C)^ 2 +(/-1) 2 (FaC) 2 

• And solve for d 2 \ 



m 2 

(/ + 1) 2 (F aC) 2 


-(7-l) 2 (F aC) 2 


111 — 


(P) 


a 
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Fog bank occlusion 

• Take derivative, set to zero, simplify: 

2 

t 4 + 2t 3 -2t + k-l = 0 k= - 2 

(F a C ) 4 

• Now need to solve quartic polynomial 
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Fog bank occlusion 

• Know what your functions look like! 



• Always k at t = 1, local min at t = 1/2 
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Fog bank occlusion 

• If function is negative at t = 1/2, 
then solution exists 

• Happens exactly when k < 27/16 

• Tempting to calculate with closed-form solution 
to quartic 

• But almost always better to use 
Newton's method, especially in this case 
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Fog bank occlusion 

• Newton's method: t i+l - 


1 

0.75 

0.5 

0.25 


- 0.25 

- 0.5 
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m 

m 
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Fog bank occlusion 

• In our case, 

f'(t) = 4t 3 +6t 2 -2 

• Start with t 0 = 1: 

/(!) = * 

/'( 1 ) = 8 


GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Fog bank occlusion 

• Calculate first iteration explicitly: 



Newton's method converges very quickly 
with 1-2 more iterations 
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Fog bank occlusion 


• Plug t back into function for d 2 to get 
culling distance 



m 2 

o + l ) 2 (FaC ) 2 


-(Y-1) 2 (F aC) 2 


If d 2 > 0 when t = 0, possible larger 
culling distance 
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Fog bank occlusion 

• Solution exists at t = 0 when k > 1 

• Solution exists deeper when k < 27/16 


Take the larger distance if both exist 
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Fog bank occlusion 


ML I] 
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k <1 
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Fog bank occlusion 

• Strategy: 

Eliminate variables 

Know what functions look like 

Embrace Newton's method 
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Contact 

• lengyel@terathon.com 

• http : //www .terathon .com/lenq vel/ 


@EricLengyel 
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Supplemental slides 

• Bezier animation curves 

• Floor and ceiling functions 

• Cross product trick 

• Bit manipulation tricks 
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Bezier animation curves 

• Another note about Newton's method 

• Cubic 2D Bezier curves often used to animate 
some component of an object's transform 


Position x, y, z or rotation angle, for example 
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Bezier animation curves 

• 2D coordinates on curve are time t and 
some scalar value v 

• t is not the parameter along the curve 


Big source of confusion in data exchange 
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Bezier animation curves 

• Control points specified in ( t , v ) space 


Time coordinates increase monotonically 
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Track View - Curve Editor 


i 


[I Editor Edit View Curves Keys Tangents Show 


% Objects 
[§]lKChain-LArm 
(UJTransfbrm 
|T~| Swivel Angle 
□IK Goal 
frfl Position 
□ X Position 
B~| Y Position 
n z Position 
□Rotation 
FI X Rotation 
IF~1 Y Rotation 
□ Z Rotation 
□ Scale 
□Enabled 
/!? Object (Object) 
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Bezier animation curves 

• To evaluate the value of a curve P(s) 

at a given time t, it's necessary to find the 
parameter s along the curve for which P t (s ) = t 

• Requires solving a cubic polynomial 


Newton's method perfect for this case 
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Bezier animation curves 


• For details, see Track structure 
in OpenGEX Specification 

• openaex.ora 


Open Game Engine Exchange 
Specification 

V«3Hn i-ij 



EKLtnwd 
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Floor and ceiling 

• Not all CPUs have floating-point 
floor/ceil/trunc/ round instructions 

• Need to implement with ordinary math 


Needs to be fast, no FP/int conversions 
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Floor function 

• 32-bit float has 23 bits in mantissa 

• Thus, any value greater than or equal to 2 23 is 
necessarily an integer 


No bits left for any fractional part 
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Floor function 

• Trick is to add and subtract 2 23 

• The addition causes all fraction bits 
to be shifted out the right end 


The subtraction shifts zeros back into the space 
previously occupied by the fraction 
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Floor function 

• When we add 2 23 , the original number is 
rounded to the nearest integer + 2 23 


If result is greater than original number, then 
simply subtract one to get floor 
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Floor function 

• What about negative numbers? 

• Use the same trick, but subtract 2 23 first, and 
then add it back 

• Can combine for all possible inputs 
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Floor function 

ml 2 8 f loor ( ml28 x) 


ml28 one = { 0x3F800000 } ; 

ml28 two23 = { 0x4B000000 } ; 

ml28 f = _mm_sub_ps (_mm_add_ps ( f , two23) , 

f = _mm_add_ps (_mm_sub_ps (x, two23) , two23) 
f = _mm_sub_ps ( f , _mm_and_ps (one, _mm_cmplt 
return (f) ; 


two23) ; 
ps (x, f ) ) ) ; 


} 
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Floor function 

• But wait, this fails for some very large inputs 
(bigger than 2 23 ) 

• All of these inputs are already integers! 

• They must be if they're bigger than 2 23 
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Floor function 

• So just return the input if it's > 2 23 

ml 2 8 sgn = {0x80000000}; 

ml 2 8 msk = _mm_cmplt_ps ( two23 , _mm_andnot_ps (sgn, x) ) 

f = mm or ps ( mm andnot ps (msk, f ) , mm and ps (msk, x) ) ; 


• ^ 


GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Ceiling function 

• Instead of subtracting one if result is greater 
than input, add one if result is less than input 


f 


mm add ps(f, mm and ps (one, mm cmplt ps (f , x) ) ) ; 



GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Floor and ceiling 

• Strategy: 


Reduce problem domain 
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Cross product trick 

• Cross product V x w given by: 

V.yzx * W.zxy - W.zxy * V.yzx 


Two mults, one sub, four shuffles 
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Cross product trick 

• Can do this instead: 

(V * W.yzx - V.yzx * W).yzx 

• Two mults, one sub, three shuffles 

• And same shuffle each time 
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Bit manipulation tricks 

• Range checks 

• Non-branching calculations 

• Logic formulas 
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Integer range checks 

• Integer range checks can always be done with a 
single comparison: 


(unsigned) (x - min) <= (unsigned) (max - min) 
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Non-branching calculations 

• Using logic tricks to avoid branches in 
integer calculations 

• Many involve using sign bit in clever way 

• Also useful to know -x == ~x + l 
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Non-branching calculations 

• Helps scheduling, increases ILP 

• Reduces pollution in branch history table 

• But can obfuscate code 

• Use where performance is very important 

• Don't bother elsewhere 
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Clever uses of sign bit 

• if (a < 0 ) ++x; 

• Replace with: 


• X 


= a » 31; 


// 32-bit ints 
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Right-shifting negative integers 

• Shifting n-bit int right by n - 1 bits: 

• All zeros for positive ints (or zero) 

• All ones for negative ints 


C++ standard says a » 31 is "implementation- 
defined" if a is negative 
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Right-shifting negative integers 

• Any sensible compiler generates instruction that 
replicates sign bit 

• To avoid issue in this case, could also use: 


• x += (uint32 ) a >> 31 
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Predicates for 32-bit signed ints 


(X 

== 0) 

lzcnt (x) 

» 5 


(X 

!= 0) 

( lzcnt (x) 

» 5) 

A 

(X 

< 0) 

(uint 32 ) 

x » 31 

(X 

> 0) 

(uint32 ) 

-x » 

31 

(X 

== y) 

lzcnt (x - 

y) » 

5 

(X 

!= y) 

(uint32 ) 

( (x - 

y) 


(y - x) ) » 31 


IzcntQ is leading zero count 



GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Absolute value 

• y = x » 31 

• abs (x) = (x A y) - y 

• Because -x = ~x + l 


x A OxFFFFFFFF - OxFFFFFFFF 


GAME DEVELOPERS CONFERENCE® 2015 


MARCH 2-6, 2015 GDCONF.COM 


Conditional negation 

• Same trick can be used to negate 
for any bool condition: 


• if (condition) x = -x; 


• X = (x 


A 


-condition) + condition 
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Logic Formulas 


Formula | Operation / Effect | Notes 


x & (x - 1) 

Clear lowest 1 bit. If result is 0, then x is 2 n . 

X I (X + 1) 

Set lowest 0 bit. 

X I (X - 1) 

Set all bits to right of lowest 1 bit. 

X & (x + 1) 

Clear all bits to right of lowest 0 bit. If result is 0, then x is 2 n 

X & -X 

Extract lowest 1 bit. 

~x & (x + 1) 

Extract lowest 0 bit (as a 1 bit). 

~x | (X - 1) 

Create mask for bits other than lowest 1 bit. 

X I ~(x + 1) 

Create mask for bits other than lowest 0 bit. 

X I -X 

Create mask for bits left of lowest 1 bit, inclusive. 

x A -X 

Create mask for bits left of lowest 1 bit, exclusive. 

~x | (X + 1) 

Create mask for bits left of lowest 0 bit, inclusive. 

~x A (X + 1) 

Create mask for bits left of lowest 0 bit, exclusive. Also x = (x + 1). 

x A (X - 1) 

Create mask for bits right of lowest 1 bit, inclusive. 0 becomes -1. 

~x & (x - 1) 

Create mask for bits right of lowest 1 bit, exclusive. 0 becomes -1. 

x A (X + 1) 

Create mask for bits right of lowest 0 bit, inclusive. remains -1. 

X & (~x - 1) 

Create mask for bits right of lowest 0 bit, exclusive. remains -1. 


This table from "Bit Hacks for Games", Game Engine Gems 2 . A K Peters, 2011. 
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